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Abstract 

In this paper, a new low-complexity demodulation scheme is proposed for interleaved orthogonal 
frequency division multiple access (OFDMA) downlink system with N subcarriers and M users using 
circular convolution. In the proposed scheme, each user's signal is extracted from the received interleaved 
OFDMA signal of M users by using circular convolution in the time domain and then fast Fourier 
transformed in the reduced size N/M. It is shown that the computational complexity of the proposed 
scheme for the interleaved OFDMA downlink system is much less than that of the conventional one. 

Index Terms 

Circular convolution, low-complexity demodulation, fast Fourier transform (FFT), orthogonal fre- 
quency division multiplexing (OFDM), orthogonal frequency division multiple access (OFDMA). 

I. Introduction 

Orthogonal frequency division multiplexing (OFDM) can achieve high-speed data transmis- 
sion in the multipath fading channel by dividing a high-rate data stream into parallel low-rate 
streams on orthogonal subcarriers JH, ED- If the number of subcarriers and guard interval are 
large enough, channel for each subcarrier becomes approximately frequency flat and high-order 
modulation can be used. Due to robustness of OFDM system against the multipath fading, it 
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has already been adopted in various wireless communication systems such as wireless local area 
network (WLAN) Q, wireless metropolitan area network (WMAN) [41, and 3GPP-LTE 0. 

In addition, orthogonal frequency division multiple access (OFDMA) has been adopted as 
one of multiple access schemes for the wireless communication networks. In OFDMA system, 
time or frequency resources are used to separate multiple user signals. Generally, most of 
users experience low battery life of mobile stations and signal processing schemes for data 
transmission considerably affect the power consumption. Among signal processing schemes 
used in mobile stations, fast Fourier transform (FFT) and inverse fast Fourier transform (IFFT) 
occupy a significant portion of total computational complexity [6J. Therefore, reduction of the 
computational complexity of FFT/IFFT in OFDMA system is highly demanded. 

In OFDMA system, a portion of iV subcarriers is allocated to each user but all users' data on 
N subcarriers are always received by each user, where iV-point FFT is needed [7J. Recently, by 
dividing the iV-point discrete Fourier transform (DFT), a low-complexity demodulation scheme 
is proposed for interleaved OFDMA downlink system flU. However, this scheme can work only 
when the number of subcarriers allocated to each user is small. Since other users' data do not 
need to be exploited for each user, a method to extract the signal of each user from the received 
OFDMA signal is necessary for the reduced size FFT. 

In this paper, a new low-complexity demodulation scheme is proposed for interleaved OFDMA 
downlink system with iV subcarriers and M users using circular convolution. In the proposed 
scheme, each user's signal is extracted from the received interleaved OFDMA signal of M users 
by using circular convolution in the time domain and then fast Fourier transformed (FFTed) in 
the reduced size N/M instead of the size N. It is shown that the computational complexity 
of the proposed demodulation scheme is much less than that of the conventional demodulation 
scheme for the interleaved OFDMA downlink system. 

The rest of the paper is organized as follows. In Section HH the interleaved OFDMA system is 
briefly introduced. In Section [Till u ser signal extraction matrices are proposed for the interleaved 



OFDMA downlink system. In Section IV a new low-complexity demodulation scheme for the 
interleaved OFDMA downlink system using circular convolution is proposed. The computational 
complexity of the proposed demodulation scheme is analyzed in Section |V} Finally, conclusion 
is given in Section fVT) 
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Fig. 1, An interleaved OFDMA of four users with N = 16 in the frequency domain. 



II. Interleaved OFDMA System 

In OFDMA downlink system, a data bit stream is mapped into an input symbol vector X = 
[Xq Xi ■ ■ ■ X N _i] T modulated by PSK or QAM, where N is the number of subcarriers and 
(•) T denotes the transpose of a vector. Note that a high-rate data stream of M users is split into 
N low -rate data streams, which are loaded on the N subcarriers. By IFFTing the input symbol 
vector X, i.e., x = IFFT(X), an OFDMA signal is obtained as 

1 N ^ 

x n =-=Y,X k ef*Tr i , 0<n<N-l (1) 

where j = yf—i and x = [xq X\ • • • xn-i] t is the OFDMA signal vector. Each OFDMA signal 
of M users is transmitted over an independent multipath fading channel. 

Since the interleaved OFDMA downlink system is considered in this paper, we assume that 
the N subcarriers are partitioned into M disjoint interleaved subcarrier groups for M users by 
taking every M th subcarrier. Then, an input symbol vector X is partitioned into M disjoint user 
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(2) 



m=l 



where X m = [X mfi X mA ■ ■ ■ X m>Af _i] T , 1 < m < M, is given as 




Xfc, if k = m — 1 mod M 
0, otherwise. 



(3) 



Clearly, each subcarrier is allocated to only one user and all subcarriers in the mth subcarrier 
group is assigned to the mth user. Thus, for each user, the number of assigned subcarriers is 



Suppose that the received signal vector at the receiver of the mth user is r = [ro r\ ■ ■ ■ rjy^i] T 
in time domain. Then, the received symbol vector R = [R R\ • • • Rn-i] t is obtained by 
iV -point FFT of r. Finally, the mth user can obtain the received user symbol vector R m = 
[R m ,o Rm,i ■ ■ ■ Rm,N~i} T , 1 < m < M, from the received symbol vector R to extract its own 
data. Fig. [2] shows a block diagram of the conventional OFDMA downlink system. 

III. User Signal Extraction Matrix for Interleaved OFDMA Signals 

In this section, a new approach to extract the mth user signal from the interleaved OFDMA 
downlink signal is proposed. 

A. Derivation of User Signal Extraction Matrix 

Let x m = [x m> o x m> \ ■ ■ ■ x rnt N-i} T , 1 < m < M, denote the mth user signal vector which is 
an IFFT output of the mth user symbol vector X m as x m = IFFT(X m ). By using the linear 
property of IFFT, the OFDMA signal vector x can be written as 



Now, we derive the user signal extraction matrix to extract x m from x as follows. First, let 
P m = [P m ,o P m ,r ■ ■P m ,N-i} T be the mth user signal extraction vector, where P m>k G {0, 1}, 
1 < m < M and < k < N — 1. If the subcarrier k is allocated to the mth user, P m ^ = 1 and 
otherwise P m k = 0. Then, the mth user symbol vector X m can be expressed as 



N/M. Fig. [T] shows an interleaved OFDMA of four users in frequency domain. 



M 




(4) 



m=l 




(5) 
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Fig. 2. A block diagram of the conventional OFDMA downlink system: (a) Transmitter of the base station and (b) receiver of 
the mobile station for the mth user. 



where 
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m,0 







P m ,i 







(6) 



• • • P m ,N-l 

is an N x iV diagonal matrix corresponding to the mth user signal extraction vector P m . Then, 
the mth user signal vector x m can be rewritten as 

x m = IFFT(X m ) = QM m X (7) 



where 
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(8) 
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is the IFFT matrix with Wn = e J ' £ . Since x = IFFT(X) = QX, the input symbol vector is 
expressed as 

X = Q x x (9) 

where Q 1 denotes the FFT matrix. From ^ and the mth user signal vector x m can be 
obtained as 

x m = QM m Q- 1 x. (10) 

Therefore, the following mth user signal extraction matrix T m can be used to directly extract 
x m from x: 

T m = QM m Q~ 1 . (11) 
By using the convolution property of IFFT, the user signal extraction matrix T m can be 



expressed by using the user signal extraction vector P m [O. From (10), it is easy to see that 
the mth user signal vector x m can be generated by a circular convolution of the OFDMA signal 
vector x and the IFFTed user signal extraction vector of P m as 

x m = IFFT(P m ) ®iv x (12) 

where ®at denotes the iV-point circular convolution. Let t m = [t m> o £ m ,i ••• t m Ar_i] T = 
IFFT(P m ), that is, 

N-l 

_L V A ■ 2nkn 

t m>n = —= } P m , k e J n . (13) 
Then, the mth user signal vector x m can be rewritten as 

X m = t m Oat x (14) 

and thus 



N-l 

./• 

v=0 

where the subscript n — v is computed by modulo iV 



,n ^ ^ trn,v%n—v (15) 



From ([TT|) and ( JT2| ), it is easy to see that T m is a circulant matrix and t m is the first column 
vector of T m . Then, the mth user signal extraction matrix T m can be rewritten as 

T - ft <0> t <x> t <2> • • • t <iV_1> l (\6) 

J-m — L L m L m L m L m J V lu ^ 



where t< r> is a circularly down-shifted version of the column vector t m by r. From (16), if 
the user signal extraction vector P m is given, the column vector t m and the corresponding user 
signal extraction matrix T m can be determined. 
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B. User Signal Extraction Matrices for Interleaved OFDMA Downlink System 

For OFDMA system, there are several schemes to multiplex x m into x, that is, adjacent, 
interleaved, and random subcarrier allocation schemes ifTOll . However, adjacent and random 
subcarrier allocation schemes are not efficiently implemented by using T m derived in Subsection 
III.A because most of the elements in t m are nonzero. Therefore, the computational complexity 
of the circular convolution between t m and x may be larger than that of IFFT. On the other 
hand, the interleaved subcarrier allocation scheme is suitable for the implementation using T m 
because there are a few nonzero elements in t m . 

The element of the mth user signal extraction vector P m for the interleaved OFDMA system 
is given as 



771, k 



I, k = Mi + m-1 
0, otherwise 



(17) 



where < % < N/M — 1 and 1 < m < M. From (17), the elements of t m for the interleaved 
OFDMA system are derived as 



N-l 



k=0 
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3 N 
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(18) 



0, otherwise 
where < I < M — 1 and < n < N — 1. All the elements in t m for the interleaved subcarrier 



allocation scheme are zero except for M nonzero elements at n = jjl, which take the values 
from { 



N i — -0 \/N i — -1 

— pJ M u Xil. f>J M 1 



Al 



' M 



K e 3 m'( m !)}. Thus, the column vector t m is expressed as 
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(19) 



October 22, 2012 



DRAFT 



8 



For example, if M — 2, 



and if M = 4, 
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(20) 



(21) 



For M = 2 or 4, to generate the mth user signal vector x m , (M — l)N complex additions for the 
circular convolution of t m and x are only required because the multiplication by {±1, ±j} can 
be ignored. However, for M > 4, (M — 1)N complex additions and (M — 4)A^ complex mul- 
tiplications are required for the circular convolution of x and t m if the complex multiplications 

. 27r(m — 1) 



by e ] m 1 e {±1, ±j}, < / < M - 1, are ignored. 



IV. Low-Complexity Demodulation for Interleaved OFDMA Downlink System 

In this section, an algorithm to reduce the computational complexity of FFT for the demod- 
ulation of each user's signal in the interleaved OFDMA downlink system is proposed. 
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A. Subvector Rotation Expression of the mth User Signal Vector 



Since t m has only M nonzero elements, from ( [15] ) and ( |T8[ ), the nth element of the mth user 
signal vector x m is expressed as 

JV-l 

v=0 "' 1=0 

where the subscript of x n _N_i is computed by modulo N. By using the property of circular 
convolution, (122b can be written as 



M 



M-l 

iV \ -\ . 27r(ro-l) . 



Af 



(22) 



M 



, r M-l 

1\ ST - . 27r(m-l) 



. 27r(m— 1) 
2 J A? S Xr 



(23) 



where < s < M—l. Let x r 



x° T x 1 T 



r RI-l T 



1 T 



where xf 



is the sth subvector of the mth user signal vector. From ( [23] ), x m can be expressed as 

T 



X m ,§(s+l)-l\ 



n T ,- 27r(m-l) n 
U A/ ^ U 



X m e 



^•^^(M-l) 



(24) 



where xi 



- 27r(m — 1) 



x„, 1 < s < M - 1 and 1 < m < M. Note that (24) implies that the 



first N/M elements in each user signal vector can be used to generate the other (M — 1)N/M 



elements only through the phase rotation by 



27r(m-l) 
M 



s, 1 < s < M — 1. As a result, the circular 



convolution for the first subvector of the mth user signal vector is only required because the 
remaining subvectors can be obtained directly from the phase rotation of the first subvector. 



Therefore, x m , n in (22) needs to be computed only for < n < N/M — 1. 



B. Low-Complexity Demodulation Scheme 

In an interleaved OFDMA downlink system, each user receives the OFDMA signal conveying 
all M users' data. In general, the mth user symbol vector X m is recovered by performing iV-point 
FFT to the received OFDMA signal vector r in the conventional OFDMA system. However, in the 
proposed low-complexity demodulation scheme for the interleaved OFDMA downlink system, 
the received mth user signal vector r m is extracted from the received OFDMA signal vector r, 
and then iV/M-point FFT is performed to r m to obtain the mth user symbol vector X m as will 
be explained below. Thus, each user does not need to do N -point FFT generating unnecessary 
other users' data. 
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Similar to x m in ([14]), the received mth user signal vector r m can be extracted as 

r m = t m ®tv r (25) 

where r m = [r m r m i • • • r TOi jv_i] T . As described in Subsection IV-A, the first received subvector 
of length N/M in the received mth user signal vector r m = [r^ T r^ T • • • r£f~ lT ] T is 
computed and the remaining received sub vectors r^, 1 < s < M — 1, are obtained through 
the phase rotation of by 27r ^~^ s. It is clear that the BER performance of the interleaved 
OFDMA downlink system is not degraded by the proposed demodulation scheme. 

The kth symbol R m ,k after performing iV-point FFT of r m at the receiver of the mth user is 
expressed as 



N-X 

R m:k = Vr m , n e- J — , 0<fc<iV-l. (26) 

v n=0 

To demodulate the mth user signal, we are only interested in the mth subcarrier group. Therefore, 
we will only consider the symbols Rm,Mi+m-x, < i < N/M — 1, given as 

N-X 

1 ^ — -\ _ . 2ir(Mi+m — l)n 

R m ,Mi+m-X = —7^ 2^ rm ' ne ~ J N • ^ 
n=0 



Similar to (22), the received mth user symbol R m Mi+m-x m <27) can be expressed as 



— — l 

1 ^ A ' . a.(m-l). , 3.(Mi+ m -l)(g.4-) 



1 v v -^-rria ■ 27r(Mi + m-l)n 



s=0 n=0 

MA/ 
r — -v / .2ir(m— l)n\ „■ 'lirin, 

= ~7^2^ \rm,rfi~ 3 ^~) e 3 ^'. (28) 

* n=0 

From (28), it is easy to see that N/M data symbols R m ^ M i+m-x, < i < N/M — 1, of the mth 
user can be obtained by applying iV/M-point FFT. In order to directly perform the iV/M-point 
FFT, the modified mth user signal is defined as 

/ .2ir(m— l)n N 

f m>n = v^e" 3 — r m>Ii) < n < — - 1 (29) 

where r m = [f mfi f m ,x • • • r ril: N__ 1 ] T . Let R m = [R mfi R m>1 ■ ■ ■ R m K-x[ T be the demodulated 
mth user symbol vector consisting of N/M data symbols of the mth user as R m ,i = Rm,Mi+m-x, 
< i < N/M — 1. Then, R m can be obtained by performing iV/M-point FFT to f m as 

R m = FFT(r m ). (30) 
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Fig. 3. A block diagram of the proposed demodulation scheme for the mth user. 



Fig. [3] shows the structure of the proposed demodulation scheme at the receiver of the mth 
user. The proposed demodulation scheme requires the computational complexity to obtain r m 
from r using user signal extraction matrix and the computational complexity of iV/M-point FFT. 
For M = 2 or 4, the computational complexity of circular convolution to obtain r m is given as 
(M—1)N/M complex additions because only the first N/M elements of r needs to be calculated, 
where the complex multiplication by {±1, ±j} is ignored. When M > 4, (M — 1)N/M complex 
additions and (M — A)N /M complex multiplications are required for the proposed demodulation 

■ 2ir(m— 1) 

scheme, where the complex multiplication is ignored if e J m s G {±1, ±j|, < s < M — 1. 
It is clear that the computational complexity of the proposed demodulation scheme is much 
smaller than that of the conventional iV-point FFT, which will be analyzed in the next section. 

V. Computational Complexity Analysis 

In this section, the computational complexity of the proposed demodulation scheme is ana- 
lyzed. In order to recover the user symbol vector in the interleaved OFDMA downlink system, one 
N -point FFT at each receiver is generally required, where the number of complex multiplications 
and additions for the iV-point FFT are (N/2) log 2 N and iVlog 2 N, respectively. In the proposed 
demodulation scheme, the computation to obtain r m from r and one N/M-po'mt FFT operation 
for each user are needed. Table [I] compares the computational complexity of the conventional N- 
point FFT scheme and the proposed demodulation scheme for the interleaved OFDMA downlink 
system, where the multiplication by an element in {±1, ±j} is ignored. 

In order to obtain the received mth user signal vector r m by the proposed user signal extraction 
scheme, circular convolution to calculate the first N/M elements of t m ®at r is required and 
for M = 2 and 4, additional N/2 and 3iV/4 complex additions are required, respectively. When 
M > 4, the total number of complex additions to calculate t m ® N r is still (M — i)N/M, and 
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TABLE I 

Computational complexity of the conventional and proposed demodulation schemes for the 
interleaved ofdma downlink system with jv subcarriers and m users 





Total number of 
complex multiplications 


Total number of 
complex additions 


Conventional N -point FFT 
scheme 




7Vlog 2 iV 


Proposed scheme (M = 2, 4) 


1 AT , JV 
2M u &2 A/ ' M 


N lOE W + t* 1 " 1 )^ 


Proposed scheme (M > 4) 


N los N + 

2M lu &2 M ~ M 


JV , JV , (Af-l)JV 
A/ 10 S2 M + M 



(M — A)N/M complex multiplications are needed if the complex multiplication by e 1 m s g 
{±1, ±j}, < s < M — 1, is ignored. To convert r m to f m , A^/M complex multiplications are 
required. Finally, (N/2M) \og 2 (N / M) complex multiplications and (N/M) log 2 (N/M) complex 
additions are required for iV/M-point FFT computation of r m . Fig. |4] compares the computational 
complexity of the conventional N -point FFT scheme and the proposed demodulation scheme 
for iV = 1024. Fig. [4] shows that the number of complex multiplications and complex additions 
required for the proposed demodulation scheme is much less than that of the conventional Ap- 
point FFT scheme. 

To more practically compare the proposed demodulation scheme with the conventional iV-point 
FFT scheme, the total number of flops are counted. It is known that one complex multiplication 
and one complex addition involve 6 flops and 2 flops, respectively 0TJ. Fig. [5] compares the 
total number of flops for iV-point FFT and the proposed demodulation scheme for iV = 1024. 
From Fig. [5J it is shown that the computational complexity to generate the mth user symbol 
vector is considerably reduced by using the proposed demodulation scheme. 

The computational complexity reduction ratio (CCRR) [[121 of the proposed demodulation 
scheme over the conventional iV-point FFT scheme is defined as 

CCRR =(l- Complexity of the proposed scheme \ 

V Complexity of iV-point FFT J v ' 

The CCRR of the proposed demodulation scheme over iV-point FFT for various iV is given in 

Table [TTJ. which clearly shows that the proposed demodulation scheme is computationally much 
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TABLE II 

CCRR OF THE PROPOSED DEMODULATION SCHEME OVER iV-POINT FFT FOR N = 512, 1024, AND 2048 



N 


512 


1024 


2048 


M 


4 


8 


16 


32 


4 


8 


16 


32 


4 


8 


16 


32 


Complex multiplications 


75 


77.8 


78.5 


78.5 


75 


78.8 


80 


80.3 


75 


79.5 


81.3 


81.8 


Complex additions 


72.2 


81.9 


86.1 


87.8 


72.5 


82.5 


86.9 


88.8 


72.7 


83 


87.5 


89.5 


Flops 


73.9 


79.4 


81.5 


82.2 


74 


80.3 


82.8 


83.7 


74.1 


80.9 


83.75 


84.9 



more efficient than the conventional one. In Table |n[ it is shown that the proposed demodulation 
scheme can reduce the computational complexity by about 80 % compared to iV-point FFT. Also, 
while the demodulation scheme dividing DFT in [8] is useful for small N/M, the computational 
complexity of the proposed demodulation scheme is always much lower than that of iV-point 
FFT regardless of the value of N/M. 

VI. Conclusion 

In this paper, we proposed a new low-complexity demodulation scheme for the interleaved 
OFDMA downlink system using the user signal extraction matrix. It is shown that the column 
vector of user signal extraction matrix for the interleaved subcarrier group consists of N — M 
zero elements and M nonzero elements. Using circular convolution of the received OFDMA 
signal vector and the column vector of user signal extraction matrix in the time domain, the 
received mth user signal vector can be extracted from the received OFDMA signal containing 
M users' data and the A/M-point FFT is applied to the modified mth user signal. Since the 
computational complexity of the proposed demodulation scheme is much lower than that of the 
Appoint FFT without BER degradation, it can be a good candidate demodulation scheme for 
the interleaved OFDMA downlink system. 
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Fig. 4. Comparison of the computational complexity of the conventional TV-point FFT scheme and the proposed demodulation 
scheme for N = 1024 with M = 4, 8, 16, and 32: (a) The number of complex multiplications and (b) the number of complex 
additions. 
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Fig. 5. Comparison of the number of flops in the conventional iV-point FFT scheme and the proposed demodulation scheme 
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